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BACKGROUND OF THE INVENTION 
FIELD OF THE INVENTION 



This invention relates to systems and methods for transmission of still images 
over relatively low-speed communication channels. More specifically the invention 
relates to progressive image streaming over low speed communication lines, and may 
be applied to a variety of fields and disciplines, including commercial printing and 
25 medical imaging, among others. 

BRIEF DESCRIPTION OF THE PRIOR ART 

30 In a narrow bandwidth environment, a simple transfer to the client computer of 

any original image stored in the server's storage is obviously time consuming. In many 
cases the user only wishes to view a low resolution of the image and perhaps a few 
more high-resolution details, in these instances it would be inefficient to transfer the 
full image. This problem can be overcome by storing images in some compressed 

35 formats. Examples for such formats are standards such as Progressive JPEG (W. 
Pennebaker and J. Mitchel, "JPEG, still image data compression standard", VNR, 
1993) or the upcoming JPEG2000 (D. Taubman, "High performance scalable image 
compression with EBCOT", preprint, 1999). These formats allow progressive 
transmission of an image such that the quality of the image displayed at the client 

40 computer improves during the transmission. 
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In some application such as medical imaging, it is also necessary that 
whenever the user at the client computer is viewing a portion of the highest resolution 
of the image, the progressive streaming will terminate at lossless quality. This means 
that at the end of progressive transmission the pixels rendered on the screen are exactly 
the pixels of the original image. The current known "state-of-the-art" wavelet 
algorithms for progressive lossless streaming all have a major drawback: their rate- 
distortion behavior is very inferior to the "lossy" algorithms. The implications are 
serious: 

1 . Whenever the user is viewing any low resolution of the image (at low 
resolutions the term "lossless" is not well defined) more data needs to be sent 
for the same visual quality. 

2. During the progressive transmission of the highest resolution, before lossless 
quality is achieved, more data needs to be sent for the same visual quality. 

Researchers working in this field are troubled by these phenomena. As F. Sheng, A. 
Bilgin, J. Sementilli and M. W. Marcellin say in [SBSM]: . .Improved lossy 
performance when using integer transforms is a pursuit of our on-going work." Here is 
an example: 



Wavelet 


Rate (bit per pixel) 


0.1 


0.2 


0.5 


0.7 


1.0 


Floating Point 7x9 


24.18 


26.65 


31.64 


34.17 


36.90 


Reversible (4,4) 


23.89 


26.41 


31.14 


33.35 


35.65 



Table 1 - Comparison of the lossy compression performances 
(implemented by the (7,9) Wavelet) to a lossless compression 

(implemented by a reversible (4,4) Wavelet) of "Barabara" image 
(PSNR (dB)) ([SBSM]). 
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5 As one can see from Table 1, state of the art progressive lossless coding is 

inferior to lossy coding by more than 1 dB at the high bit-rate. 

Indeed, intuitively, the requirement for lossless progressive image transmission 
should not effect the rendering of lower resolutions or the progressive "lossy" rendering 
of the highest resolution before lossless quality is obtained. The final lossless quality 
10 should be a layer that in some sense is added to a lossy algorithm with minor (if any) 
effect on its performance. 

The main problem with known lossless wavelet algorithms, such as SPIHT 
[SP1] and CREW [ZASB], is that they use special "Integer To Integer" transforms (see 
"Wavelet transforms that map integers to integers", A. Calderbank, I. Daubechies, W. 
15 Sweldens, B. L. Yeo, J. Fourier Anal. Appl., 1998). These transforms mimic 
Q "mathematically proven" transforms that work well in lossy compression using 

|r j floating-point arithmetic implementations. But because they are constraint to be 

fi lossless, they do not approximate their floating-point ancestors sufficiently well. 

W Although in all previous work there have been attempts to correct this approximation in 

S j 20 the progressive coding stage of the algorithm, the bad starting point, an inefficient 

transform, prevented previous authors to obtain decent rate-distortion behavior. 
45 Our algorithm solves the rate-distortion behavior problem. Using the fact that 

images are two-dimensional signals, we introduce new 2D lossless Wavelet transforms 
that approximate much better their lossy counterparts. As an immediate consequence 
25 our lossless progressive coding algorithm has the same rate-distortion of a lossy 
algorithm during the lossy part of the progressive transmission. 

SUMMARY OF THE INVENTION 

The imaging system that is described below is directed to a lossless image 
30 streaming system that is different from traditional compression systems and overcomes 
the above problems. By utilizing a lossless means of progressive transmission means 
the pixels rendered on the screen at the end of transmission are exactly the pixels of the 
original image that was transmitted. The imaging system disclosed herein eliminates 
the necessity to store a compressed version of the original image, by streaming ROI 
35 data using the original stored image. The imaging system of the present invention also 
avoids the computationally intensive task of compression of the full image. Instead, 
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once a user wishes to interact with a remote image, the imaging server performs a fast 
preprocessing step in near real time after which it can respond to any ROI requests also 
in near real time. When a ROI request arrives at the server, a sophisticated progressive 
image-encoding algorithm is performed, but not for the full image. Instead, the 
encoding algorithm is performed only for the ROI. Since the size of the ROI is bounded 
by the size and resolution of the viewing device at the client and not by the size of the 
image, only a small portion of the full progressive coding computation is performed for 
a local area of the original image. This local property is also true for the client The 
client computer performs decoding and rendering only for ROI and not the full image. 
This real time streaming or Pixels-On-Demand™ architecture requires different 
approaches even to old ideas. For example, similarly to some prior art, the present 
imaging system is based on wavelets. But while in other systems wavelet bases are 
selected according to their coding abilities, the choice of wavelet bases in the present 
imaging system depends more on their ability to perform well in the real time 
framework. The system of the present invention supports several modes of progressive 
transmission: by resolution, by accuracy and by spatial order. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a system architecture block diagram. 
Figure 2 is an imaging system workflow diagram. 

Figure 3 is a flow diagram representing a "lossless progressive by accuracy" 
request list for a ROI. 

Figure 4 is a diagram depicting the client "progressive by accuracy" workflow. 
Figure 5 is a diagram depicting the server workflow. 
Figure 6 is a diagram describing the server-preprocessing step. 
Figure 7 is a diagram describing the low resolution encoding process. 
Figure 8 is a diagram describing the ROI high resolution processing. 
Figure 9 is a diagram depicting the local forward lossless wavelet transform. 
Figure 10 is a diagram depicting the local inverse lossless wavelet transform. 
Figure 1 1 is a diagram depicting the progressive rendering steps. 
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Figure 12 is a diagram depicting a lossless subband tile wherein the spatial 
grouping of subband coefficients are at a given resolution and the halfbit matrix is 
associated with the hhj subband. 

Figure 13 a diagram depicting the RGB <-> YUV reversible conversion. 

Figure 14 a diagram depicting a lossless last bit plane data block in which only 
hi and hh subband coefficients are scanned. 

Figure 15 is a sample pseudo-code of the encoding algorithm represented by: (a) 
Least significant bit plane scan pseudo-code (b) Halfbit plane scan pseudo-code. 

Figure 16 is a sample pseudo-code of the decoding algorithm represented by: (a) 
Least significant bit plane scan pseudo-code (b) Halfbit plane scan pseudo-code. 

Figure 17 is a diagram depicting the curve defining the mapping from 
Original_Image_Depth-bit image to Screen_Depth-bit. 

Figure 18 is a diagram depicting the decomposition of one-dimensional signal 
x to the Low-subband s and the High-subband d and the separable decomposition of 
two-dimensional signal X into 4 matrices (subbands): LL,HL,LH and HH . 

Figure 19 is a diagram depicting the first stage of the 2D separable transform 
step in the X-direction. 

Figure 20 is a diagram depicting the second stage of the 2D separable transform 
step in the Y-direction. 

Figure 21 is a diagram depicting the application of the full 2D Wavelet 
transform. 

Figure 22 is a flow diagram representing the least significant bit plane scan of 
the encoding algorithm. 

Figure 23 is a flow diagram representing the least significant bit plane scan of 
the decoding algorithm. 

Figure 24 is a flow diagram describing a bit plane significance scan of the 
server-encoding algorithm. 

Figure 25 is a flow diagram describing the subband tile extraction of the client 
progressive rendering process. 

Figure 26 is a diagram depicting the preprocessing multi-resolution structure. 
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DETAILED DESCRIPTION OF THE INVENTION 

1. NOTATION AND TERMINOLOGY 

The following notation is used throughout this document. 



Term 


Definition 


4D 


Four dimensional 


FIR 


Finite Impulse Response 


FWT 


Forward Wavelet Transform 


GUI 


Graphical User Interface 


ID 


Identification tag 


IWT 


Inverse Wavelet Transform 


ROI 


Region Of Interest 


URL 


Uniform Resource Locator 


LSB 


Least Significant Bit 


RMS 


Root Square Error 


FP 


Floating Point 


PDF 


Probability Distribution Function 



The following terminology and definitions apply throughout this document. 



Term 


Definition 


Rendering 


Procedure to output/display a ROI of an image into a device such 
as a monitor, printer, etc. 


Multiresolution 


A pyramidal structure representing an image at dyadic resolutions, 
beginning with the original image as the highest resolution. 


Subband Transform / 
subband coefficients 


A method of decomposing an image (time domain) to frequency 
components (frequency domain). A representation of an image as 
a sum of differences between the dyadic resolutions of the image's 
multiresolution. 


Wavelet Transform / 
Wavelet coefficients 


A special case of Subband Transform. 


Progressive 
Transmission 


Transmitting a given image in successive steps, where each step 
adds more detail to the rendering of the image 


Progressive Rendering 


A sequence of rendering operations, each adding more detail. 


Progressive by accuracy 


Transmit/render strong features first (sharp edges), less significant 
features (texture) last 


Progressive by 
resolution 


Transmit/render low resolution first, high resolution last 


Progressive by spatial 
order 


Transmit/render top of image first, bottom of image last 


Distributed database 


A database architecture that can be used in a network environment 
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Subband/Wavelet tile 


A group of subband/wavelet coefficients corresponding to a time- 
frequency localization at some given spatial location and 
resolution/frequency 


Subband/Wavelet data 
block 


An encoded portion of a subband/wavelet tile that corresponds to 
some accuracy layer 



2. OVERVIEW OF THE INVENTION 

Referring to Figure 1, a block diagram is provided depicting the various 
components of the imaging system in one embodiment. A client computer 110 is 
coupled to a server computer 120 through a communication network 130. 

10 In one embodiment, the client computer 1 10 and server computer 120 may 

comprise a PC-type computer operating with a Pentium-class microprocessor, or 
equivalent. Each of the computers 110 and 120 may include a cache 111,121 
respectively as part of its memory. The server may include a suitable storage device 
122, such as a high-capacity disk, CD-ROM, DVD, or the like. 

15 The client computer 110 and server computer 120 may be connected to each 

other, and to other computers, through a communication network 130, which may be 
the Internet, an Intranet (e.g., a local area network), a wide-area network, or the like. 
Those having ordinary skill in the art will recognize that any of a variety of 
communication networks may be used to implement the present invention. 

20 With reference to Figure 2, the system workflow is described. Using any 

browser type application, the user of the client computer 110 connects to the Web 
Server 140 or directly to the Imaging server 120 as described in Figure 1. He/she then 
selects, using common browser tools an image residing on the Image file storage 122. 
The corresponding URL request is received and processed by the Imaging Server 120. 

25 In case results of previous computations previous computations on the image are not 
present in the Imaging Cache 121, the server performs a fast preprocessing algorithm 
(see §2.1) in a lossless mode. The result of this computation is inserted into the cache 
121. Unlike other more "traditional" applications or methods which perform full 
progressive encoding of the image in an "offline" type method, the goal of the 

30 preprocessing step is to allow the server, after a relatively fast computational step, to 
serve any ROI specified by the user of the client computer. For example, for a 15M 
grayscale medical image, using the described (software) server, installed on computer 
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5 with a Pentium processor, fast disk, running Windows NT, the preprocessing step 501 
will typically take 3 seconds. This is by order of magnitude faster than a "full" 
compression algorithm such as [S], [SP1], [T]. Serving ROI is sometimes called 
"pixels-on-demand" which means a progressive transmission of any ROI of the image 
in "real-time", where the quality of the view improves with the transfer rate until 

10 lossless view is received in the client side. Once the preprocessing stage is done, the 
server sends to the client a notification that the "image is ready to be served". The 
server also transmits the basic parameters associated with the image such as 
dimensions, color space, etc. Upon receiving this notification, the client can select any 
ROI of the image using standard GUI. The ROI is formulated in step 203 into a request 

15 list that is sent to the server. Each such request corresponds to a data block (§4). The 
order of requests in the list corresponds to some progressive mode selected in the 
context of the application such as "progressive by accuracy" rendering of the ROI. 
Upon receiving the ROI request list, the server processes the requests according to their 
order. For each such request the server checks if the corresponding data block exists in 

20 the Cache 121. If not, the server then computes the data block, stores it in the cache and 
immediately sends it to the client. Once a data block that was requested arrives at the 
client, it is inserted into the cache 1 1 1 . At various points in time during the transfer 
process, a decision rule invokes a rendering of the ROI. Obviously, if some of the data 
blocks required for a high quality rendering of the ROI, were requested from the server, 

25 but have not arrived yet, the rendering of the ROI will be of lower quality. But, due to 
the progressive request order, the rendering quality will improve with each received 
data block in an "optimal" way. In case the user changes the ROI, the rendering task at 
the client is canceled and a new request list corresponding to a new ROI, sent from the 
client to the server, will notify the server to terminate the previous computation and 

30 transmission task and begin the new one. 



3. NEW REVERSIBLE WAVELET TRANSFORM 

We will now explain in detail why the rate-distortion behavior of our 
35 progressive lossless algorithm is better than other known algorithms. Lossless wavelet 
transform, must be integer-to-integer transform, such that round-off errors are avoided. 

8 
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In order to demonstrate the difference between lossy and lossless transforms, let us look 
at the simplest wavelet, the Haar wavelet. Let x(k) be the k -th component of the one- 
dimensional discrete signal x . The first forward Haar transform step, in its accurate 
"mathematical" form, is defined by: 
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^(w) =-r=(x(2« + l) + x(2n)), 

d(n) = -^ r (x(2n + l)-x(2n)), 
\2 



(3.1) 



where s is a low-resolution version of jc , and d is the "difference" between s and x . 
In the case of lossless transform, applying the above transform results in round-off 
error. One possibility is to apply the transform step suggested by [CDSI]: 



s(n) = 



x(2« + l) + x(2w) 



(3.2) 



d(n) = x {In + 1) - x (in) . 



The symbol |_°J is the floor function meaning "greatest integer less than or equal to o ", 
20 e.g. 

[0.5j = 0,[-1.5j = -l, L2j = 2,|_-lJ = -l. 

The one-dimensional transform step is generalized to 2D separable transform 
step, by applying the ID transform step twice, first in the X-direction and than (on the 
25 first stage output) in the Y-direction as described in Figure 18, 19 and 20 (see also [M] 
7.7). The full 2D Wavelet transform is applied by using the 2D Wavelet transform step 
iteratively in the classic Mallat decomposition of the image (Figure 21) ([M] 7.7). 



In (3.2) two properties are kept: 
30 1. Reversibility, i.e., one can restore x(2w)and x(2« + l), by knowing s(n)and 

as follows: 



9 
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x(2n) = s(n)- 



d(n) 
2 



(3.3) 

x(2n + l) = d (n) + x(2n). 

2. De-correlation, i.e., s(n) and d (n) contains the minimal number of bits 

required in order to follow property 1 . For example, if the transform would have 
been defined by: 

\s(n) = x(2n + l) + x(2n), ^ 
]d (n) = x(2n +l) -x(2«), 

then the least significant bit of s (n) and d (n) would have been the same and 
saved twice. In other words, there is a correlation between s(n) and d(n) in 

(3.4). From the view point of coding this should be avoided since there is a 
redundancy of transmitting this bit. 

On the other hand, the scaling property, that is a very important one, is not kept 
in (3.2). Observe that the value of s{n) computed by (3.2), is smaller than its "real 

mathematical value" as computed in (3.1), by factor of V2 . Since s(n) should be 
rounded to an integer number, the fact that s(n) is smaller than what it should be, 

increases the round-off error. In low resolutions, the error is accumulated through the 
wavelet steps. 

If we take the error as a model of t4 white noise" added to the i — th resolution in 
a multi-resolution representation of the image, i.e. X i in Figure 21, it can be proved 
that the variance of this noise exponentially increases as a function of z. This 
"contamination" to the multi-resolution image damages the coding efficiency at low 
bit-rates. Let us describe this in detail for the case of the Haar wavelet. We have two 
assumptions in our analysis: 

1. The parity (least significant bit) of an arbitrary coefficient c, in any of the 

wavelet steps is a uniformly distributed random variable, i.e. 

Pr(c=0 mod2)=Pr(c=l mod2) = -^. 
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2. This parity is independent of other coefficient's parity (Identically independent 
distributed). 

Our referenced computation, i.e. the accurate computation, is the Haar transform 
step defined in (3.1). We concentrate on the LL-subband coefficients, because the low- 
resolution subbands are computed from them. LL-subband coefficients are the result of 
a 2D-transform step (Figure 18) 

x x(2m + iqn + l) + x(2m,2n + l) + x(2m + l,2n) + x(2m,2n) 

where m and n are the indices of the row and column of the coefficient respectively. 

As described in Figure 18, 19 and 20, according to the step defined in (3.2), we 
first apply the X-direction step 



, . x(k,2n + \) + x(k,2n) 
s(k,n)= 

for each input row x (k, •) . 

Under assumption 1 mentioned above, we can write s^k^n) as 



(3.5) 



where e is a random variable with a probability distribution function (P.D.F.) 
p(-) defined by 



Therefore, 



/ >(-0.5) = Pr( e = -0.5) = | 
/ ,(0) = Pr(e = 0) = i. 



E(e) = --, Var(e) =— . 
w 4 W 16 



(3.6) 



(3.7) 
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We then apply the Y-direction transform by 



s (2m + 1, n) + s (2m, n) 



s (2m + l 9 n) + s (2m, n) 
= 2 — — + e '' ( 3 - 8 ) 



As in (3.5) we can represent s(2m + \ 9 n) and s(2m,n) by: 



( v x(2/w -h 1, 2w -hl) + x (2m + 1,2/2 ) 
? (2m + 1, wj = — h €|, 

, v x(2/w ? 2n + l) + x(2m 5 2n) 
i y(2/w ? n) = — * L + e 2 . 



(3.9) 
(3.10) 



Now we can write: 



// (CDSI) (m,n) 

x (2m + 1, 2n + 1) + x(2m + 1, 2«) x(2m, 2n + 1) + x (2m, 2n) 



+ e, +■ 



■ + e, 



x(2»j + l,2« + l) + jc(27M + l,2«) + x(2m,2« + l) + x(2w,2«) ^ e 2 

4 + y + y +e 



(3.H) 



x (2m + 1, 2« + 1) + x (2m + 1, 2w) + x (2m, 2w + 1) + x (2m, 2« ) 



where e x ,e 2 ,e % are independent (assumption 2 above) random variables with 

expectation — .Variance —.and e~ — + — + e* . 
4 16 2 2 



Therefore, 
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E (e)4 E te)4 E(e2)+E(e ' )=_ 2~' 

Var(e)=Var^+^- + c ,> | = ivar(ei)+^-Var(c 2 ) + Var(e , )= 



J 



4 



I _L I_L _L_ A 

4 16 4 16 16~32' 



Thus, 



, /(accurate) / \ 

//^(m,*) ^ >'*) + g . (3.13) 
e represents the approximation error of the LL-subband coefficients, results from one 
2D transform step. The error relates to the accurate floating-point computation. 

This was a description of a single 2D-transform step assuming that the input 
coefficients are without any error. Now we wish to evaluate the error accumulated after 
several steps. 

At an arbitrary step z>0, we can assume that an input coefficient can be 
written as: 

x i (kj) = x\~ \kj) + e i9 
where x ( accurate ) is the accurate value achieved by floating-point computation for all 
the previous steps, i.e., a step defined by 

\ x(2n + l) + x(2n) 

< S{H) ~ 2 9 (3.14) 

d(n) = x(2n + l)-x(2n) 9 

instead of the integer-to-integer computation in (3.2). Observe that if ^ MCurate ) (kj) is 
the i -th resolution image coefficient, using (3.14) as the ID Wavelet step, then 

4^>(jM) = ^"^*' 7 ) , />1, (3.15) 



where //(^ curate ) is the normalized (L 2 - norm ) LL-subband coefficient resulting 
from the i -th 2D transform step using (3.1) as the ID Wavelet step (see Figure ). e i is 
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the difference between x $ (A:,/) and x ( accurate ) (kj) (I.e., the approximation error of the 
integer computation made until now). E.g. e 0 = 0 ( x 0 (k, /) is an original image pixel), 

1 3 

while e x is a random number with expectation -— and variance — (see (3.12)). 
0. 



Using (3.11), we get: 



lf mi \m,n) 

x^ c) (2m + 1, 2#i + 1) + e\ + x ( (acc) (2 m + 1, 2n) + ef + ^ {acc) (2m, 2n + 1) + ^ + x (acc) (2m, 2w) + < 

— h £ 

4 

_ ^ (2m + 1, 2n + 1) + x^ (2m + 1, 2k) + jc'"" 0 (2m, 2n + 1) + jcf""* (2m, 2n) e]+e} + ef + 

~ 4 + ^ +< 

_ x\ we) (2m + 1, 2n + 1) + a*"* (2m + 1, 2w) + x^ (2m, 2n + 1) + ^ (2m, 2») 

" 4 + e w 

_ (accurate) 

— + , 



where e i+x is defined by 



12 3 4 

e- + er + e- + e 
e M 4 + 



and corresponds to LL. subband. 
Consequently 

E(e i+l ) = E( ei ) + E(e) 9 
Var(e fr O = ^)+Var(e). 

Observe that 

E(e)=-i Var( e ) = ^. 

As a result, we can write recursive formulas for the error expectation and variance after 
i steps. 
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E(^) = 0, 
E(e,. +1 ) = E(e,.)-i 

Var(e o ) = 0, 

Var(e,) 3 

Var(e ( . +1 ) = ^^ +— , 



(3.16) 



The explicit solutions to these formulas are 

E(e i ) = -\y a r(e i ) = - U- 

v l) 2 v ; 8 2-4' +1 



(3.17) 



77 (accurate) / \ 

t-v -1 • (accurate)/ \ • ,i "/-l [ffl^flj 

By replacing x> } \m,n) with - 



we get 



/// CDSI )(m,«) = 



inaccurate) / \ 

11) } (m^n) 



(3.18) 



Thus, the approximation to //( accurate ) is 

2 i+1 /// CDSI) fan) = fan) + 2 /+1 e. 

The approximation error expectation is 



E(2 i e ! ) = 2 i E(e i ) = 2 i (-^ = -i2 i - 



The approximation error variance and standard deviation are 



Var(2' e ,.) = 4 ! Var( e/ ) = 4'fi 1 



Hence 



Std(2 , e f ) = 1 JVar(2'e,.)« 



8 2-4' +1 
2 H 



4' - 1 4' 
— — = 2 2i '" 3 . 
8 8 



V2" 
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Let us now evaluate the approximation error of the 3 other subbands: 



^(cdsi) ^ m ^ n ^ _ 



//g DSI) (2m + l,2n + l) + /ff SI) (2m + l,2n) 



7/g DSI) (2m,2n + l) + //£ DSI) (2m,2n) 



j j (accurate) / \ 1 2 

lh\ >(m,n) | e\+e] t 
2'' 2 



^3 , 4 *\ 



^(accurate) 

2' +< 



where 

• l<k<4 are identical to the random variable whose expectation and 
variance are given in (3.17). 



• e ; = 



1 - - 2 r e 3 +e A ^ 



lh _e i +e i 



- + e- 



V 



• e y and are identical to the random variable whose expectation and variance 
are given in (3.7). 

Thus, 

E(eT) = 0, 



Var(ef)4 



1 1 1 

+ 



1 



8 2-4' +1 ' 8 2-4 ,+1 J 16 



lfl 1 1 1 



+- — 



■ + 



4^8 2-4' +1 ' 8 2-4 i+1 
1 1 



J_ 
16 



4 2-4' +1 

The approximation to lh\ &ccw ^ (m,n) is 



(3-19) 



27/*, (CDSI) (/«,«) = //*< aocurate) (m, n) + Xef . 
The approximation error variance and standard deviation are: 

Var(2<e- ) = 4' Var (ef) = 4' {~^) = 4- -I « 4". 

Therefore 
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Std(2'ef" ) = ^(I'ef") « 4^ = 2". 

A similar approximation error estimation, it can be done with the HL-subband and the 
HH-subband. 

The approximation error evaluation results are summarized in the following 
table where the error is the difference between the normalized (in L 2 - norm ) 
coefficients according to [CDSI] reversible transform and the "mathematical" 
transform (defined in (3.1)). 





Expectation 


Variance 


Std 


LL t - error 


-(/+l)2' 


A M _ 1 
8 


0.707-2'' 


LH i - error 


0 


2 - 4, '-^4M 
8 


0.5-2'' 


HL { - error 


-1.2' 
4 


3 - 4 '- 2 »3.4« 

16 


0.433-2'' 


HH- - error 


0 


4 '- 1 «2« 

8 


0.354-2'' 



Table 2 - Normalized (in L 2 - norm ) approximation errors of the 
Wavelet coefficients at resolution z>0 (z = 0 is the highest 
resolution) using the (CDSI) reversible Haar transform. 

Assuming a low-bit rate transmission where only the coefficients whose absolute 
value belongs to the range [2\2 6+1 ) are encoded, for every resolution z 3 where i is 

greater than b (less or more). It must be noted that the large error implies a significant 
loss of coding efficiency. 

Instead, we propose a new family of reversible transforms. The proposed family of 
integer wavelet transforms has all three properties: 

1 . Reversibility 
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2. De-correlation 

3. Scaling - i.e. improved approximation of the "mathematical" transform. 

Our 2D transform step is separable also, but the one-dimensional transform 
step, which the 2D transform is based on, is different for the X-direction (step 1901), 
the Y-direction step applied on the low output of the X-direction step (step 2001) and 
the Y-direction step applied on the high output of the X-direction step (step 2002) as 
described in Figure 18, 19 and 20. 

The full 2D Wavelet transform is applied by using the 2D Wavelet transform 
step iteratively in the classic Mallat decomposition of the image (Figure 21) ([M] 7.7). 
As mentioned before, the Wavelet coefficients in our proposed transform are all scaled, 
i.e. normalized in Z 2 -norm as the Wavelet coefficients computed in the accurate 
"mathematical" transform. 

In order to achieve the third property (improved approximation of the 
"mathematical" transform), we define an extra matrix we call the "Half bit-matrix" 
which enables the reversibility of the High Y-transform step (step 2002). The elements 
that belong to this matrix are bits, such that each bit corresponds to an HH-subband 
coefficient in the following interpretation. Let us describe this by the following 
example. 
Supposing 

s(n) = 7,d {l) (n) = 9 
are a coefficient pair results from a reversible de-correlated ID-wavelet step 

(n) = x(2n + l)-x(2n), 

, . x(2n) + x(2n + l) 

► = — — - 

W |_ 2 

Now, d® (n) has to be multiplied by ^ , in order to be scaled. 
The binary form of [n) = 9 is 

d {l) (n) = 1001 2 . 

If we now divide d^ l) (n) by 2 in a floating-point computation we get 



d FP (n) = ±d {l) (n) = 100 A 2 
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Let us call the bit, which located on the write-side of the floating point the "Half 
Bit". Observe that the Half Bit of d FP (n) is the LSB of d {l) (n) . Therefore, an 
equivalent way to do this in an integer computation without loosing the Half-Bit is to 
calculate first the LSB of d {l) (n) by 

Hal/Bit (n) = d {1) (n) mod 2 = 9 mod 2 = 1, 
then to shift-write (n) by 

d (n) = d (i) (n) » 1 = 1001 » 1 = 100. 
By saving d(n) and HalfBit{n) we can restore back d^(n). 

In the proposed transform, this Half-bit is needed in the HH-subband coefficient 
computation. Therefore in our wavelet decomposition for every HH-subband 
coefficient (in all scales) there is a corresponding bit, which is the coefficient's Half- 
bit. 

The Half bit matrix is hidden in the HH-subband in the description of Figure 18, 19 and 
20. It is described explicitly in the specification of the transform as much in the coding 
algorithm. 

We now present our integer-to-integer versions of the Haar transform and the 
CDF (1,3) transform for the 2-dimensional case. 



3.1 Reversible Haar and (CDF) (1,3) Transforms 

3.1.1Haar Transform 

With respect to Figure 19: 



3.1.1.1 Step 1901: X-direction 
Forward Step 



x(2n) + x(2n + l) 
2 



s(n) = 
d (n) = x(2n + l) -x(2n). 



(3.20) 
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Inverse Step 



x(2n) = s{n) — 



d(n) 



x(2n + l) = d{n) + x (2n) . 
With respect to Figure 20: 

3.1.1.2 Step 2001: Y-direction - Low Forward Step 

s(n) = x(2n) + x(2n + l), 



d {1) (n) = 



(3.21) 



(3.22) 



x(2n + lj-x{2n) 
2 

d(n) = 2d {1) (n). 
Remarks: 

1 . s ( n ) is a scaled LL-subband coefficient. 

2. s(n) and d® (n) are de-correlated and a reversible couple (can be transformed 
back to x(2n) and x(2w + l)) ? but <f^(«) is not scaled (it is half its "real 
value")- Thus, d {1) (n) is multiplied by 2. Nevertheless, the LSB of the LH- 
subband coefficient d (n) is known to be 0 and not encoded. 



Inverse Step 



x (In + 1) = {s ( n) + d (n) + (5 (/?) mod 2)) , 
x(2w) = .S'(/2)-x(2w + l). 



(3.23) 



With respect to Figure 20: 
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3.1.1.3 Step 2002: Y-direction - High Forward Step 

d {1) (n) = x(2n + \)-x(2n), 
Hal/Bit (n) = (d {1) («))mod2, 



(3.24) 



s(n) ~x{2n) + d{n). 

Remark: d®(n) and s{n) are de-correlated and reversible couples, but d®{n) is not 
scaled (It is twice its "real value"). Therefore, d^ {n) is divided by 2. By doing that, 

we lose its least significant bit, which cannot be restored. To solve this problem, as 
explained before, we save this bit as the "Half-Bit". Giving this name to that 

coefficient means that its weight is — in the "real mathematical scale", and it is the 

2 

least significant (from the approximation point of view). 
Inverse Step 

x(2n) = 



d {1) (n) = 2d(n) + HaljBit (n) , 
x(2n + l) = d {1) (w) + jc(2«). 



(3.25) 
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3.1.2 CDF (1,3) Transform 
3.1.2.1 Step 1901: X-direction 

With respect to Figure 19: 



Forward Step 



x(2n) + x(2n + l) 
2 



s(n) = 

d {1) (n) = x(2n + l)-x(2n) 
d(n) = d^(n) 



s(n-\)-s(n + l) 



Inverse Step 



</ (I) (») = <*(*)- 



x(2n) -s{n)- 



s{n-\)-s(n+\) 
4 

d (l) {n) 



x(2n + l) = x(2n) + d (l) (n). 



With respect to Figure 20: 
3.1.2.2 Step 2001: Y-direction - Low Forward Step 

s(n) = x{2n) + x(2n + \), 

s(n-\)-s{n + \) 



d (l \n)-. 



d(n) = 2d {l) (n). 



x(2n + l)-x(2n) + 



(3.26) 



(3.27) 



(3.28) 



Remark: See remarks for (3.22). 
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Inverse Step 

(o) = s(n)- 



s(n-l)-s(n + i) 
8 



x(2n + 1) = 1 (n) + d(n) + (s® (n) mod 2)), 



(3.29) 



x(2n) = s(n)-x(2n + l). 



With respect to Figure 20: 
3.1.2.3 Step 2002: Y-direction - High Forward Step 



x(2rc) + x(2«+l) 



tf (1) (/j) = x(2« + l)-x(2»), 

4 



rf (2) («) 



«/(») = 



HalJBit(n) = d {2) (n) mod 2. 



(3.30) 



Inverse Step 

d {1) (n) = 2d(n) + HalfBit («) - 



s(n-l)-s(n + l) 
4 



x(2n) = s(n)- 
x(2n + l) = d {1) (n) + x(2n). 



2 



(3.31) 



We now compute the approximation error probabilities of our method, and 
show that it is significantly smaller. We start with the LL-subband error. Assuming e. 
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is the approximation error of the LL-subband in the i - th resolution (Figure 21), the 
LL-subband coefficient in the i - th resolution can be written as: 

4 new) (2m + 1, 2n + 1) + ^ new) (2m, 2n + 1) 
2 



//( new) (w,n) = 



+ 



xf new) (2m + l,2») + 4 new) (2m,2») 



(new) , 



4 acc) (2/« + l,2w + l) + ^ 1 +4 acc) ( 2? "' 2w+1 ) + g » , 

2 



x^ (2m + 1, 2») + <t + x^ (2m, 2n) + e\ 



x( acc) (2m + 1, In + 1) + x; acc) (2m, 2n + 1) + xj acc) (2m + 1, 2n) + x| acc) (2m, 2n) 

2 ~ 



+ _< : i jH| LL + e ' +e « = 

2 



= l(r\m,n) + e t , 



(3.32) 



where 

• lll mw) (m,n) is the new transform LL-subband coefficient (z -th resolution). 

• e\_ x for 1 < A: < 4 , are identical random variable representing the error from the 
previous level. 

• e' and e n are random variables with P.D.F. defined in (3.6). 

• (m, n) is the i -th resolution image coefficient using (3.1) as the ID 
Wavelet step. 

• //} acc) (m, n) is the normalized ( L 2 - norm ) LL-subband coefficient resulting 
from the i -th 2D transform step using (3.1) as the ID Wavelet step (see Figure 
21). 



1 , 2 . 3 , 4 



&i 1 + e i 1 + ^-1 + e i-\ i ti 

• e.= — — — l — + e'+e" 

2 



Consequently 



4E( g ,„) 



+ 



/ , 4Var(e,,) 1 1 Tr , . 1 
Var (e ; ) = + — + — = Var (<?,._, ) + - 



4y 
16 16 



n = 2E(« M )~- 



(3.33) 
(3.34) 
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By knowing that e_ x = 0 we get 



Var(e,) = 



i + l 



(3.35) 



Now we can easily evaluate the approximation error of the 3 other subbands: 
//j( new ) ( OTjW ) 



= 2 



x{ new) (2m + 1, In + 1) + x< new) (2jw, 2w + 1) 




4 new) (2m + 1, In) + xf ew) (2m, 2«) 


2 




2 



(xj 3 ^ (2m + 1, In + 1) + x^ (2m, In + 1)) - (x{~> (2m + 1, 2«) + x^ (2m, 2» )) 

2 



+• 



M i-1 ;-l i-l . 1 _ II 



+ e'-e"+2e"' 



(3.36) 



where 

• /A} new ^ (m, «) is the new transform LH-subband coefficient ( i -th resolution). 

• /^ accurate) (m, «) is the normalized ( Z, 2 - norm ) LH-subband coefficient resulting 

from the i -th 2D transform step using (3.1) as the ID Wavelet step (see Figure 
21). 

• e is a random variable with P.D.F. defined in (3.6). 

12 3 4 

. e LH = g f -i + e M - e,_ x - e H ,_ H+ „, 
2 

• all other symbols are defined like in (3.32). 
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Hence 



Ef ^,_ 2E( gM )-2E( g ,._ 1 ) | 



1_ 

2' 



i + 3 



f ih\ 4Var(e. ,) 1 1 1 
v ' 4 16 16 16 8 



(3.37) 
(3.38) 



Similar estimation can be done for the HL and the HH subbands. 

The error estimation (for all subbands) are summarized in the following table 
where the error is the difference between the normalized (in L 2 -norm) coefficients 

according to our new reversible transform and the "mathematical" transform (defined 
in (3.1)). 





Expectation 


Variance 


Std 




- error 


1-2' 

2 


i + l 
8 


//+1 
V 8 


LH t 


- error 


1 

2 


i + 3 

8 


li + 3 
V 8 


HL t 


- error 


1 
4 


i 1 
8 16 




HH t 


- error 


1 
4 


i 1 
8 16 


V8 16 



Table 3 - Normalized (in L 2 ~norm) approximation errors of the 
Wavelet coefficients at resolution z>0 (z = 0 is the highest 
resolution) using the proposed reversible Haar transform. The 
result for the LL-subband is valid for the proposed reversible (1,3) 
transform also. 



The meaning of this result is that in a low bit-rate, where only large coefficients are 
encoded, this error is negligible. 
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4. IMAGING PROTOCOL AND DISTRIBUTED DATABASE 

Dividing the data into tiles and bit-planes 

For the purpose of efficient rendering the coefficients may be sub-divided into 
tiles. The tiles of this invention differ from previous art as shown in Figure 12. As in 
the lossy algorithm, here also the subband tiles are further decomposed to subband data 
blocks. Each data block of lossless subband tile (Figure 12) will have a 4D coordinate 

(t_x, tj, t_resolution,t_bitPlane) 
where 0 < t_bitPlane < maxBitPlane(t_resolution). 
Each data block contains the following data in encoded format: 

1. For t_bitPIane >2: 

a. A list of the indices of all subband coefficients whose absolute value is 

in the range [ 2 '- w ^ 2'- Wfa " ) . 

b. The sign of all the coefficients of a. 

c. For t_bitPlane >2, an additional precision bit for any coefficient that 
belongs to the current bit plane or any higher bit plane. 

2. For tJbitPlane =1, which we call the "least significant bit plane": 

a. A list of the indices of HL-subband and HH-subband coefficients whose 
absolute value belongs to the set {-1,0,1} . 

b. A "zero flag" for each coefficient of a, which indicates if the coefficient 
is equal to zero or not. 

c. The sign of all the coefficients of a, whose "zero flag" is false. 

d. The LSB of the HL-subband and HH-subband coefficients that belong to 
higher bit plane. 

Remark: 

Since the LH-subband contains only even coefficients, their LSB must be 
zero and is not coded. 

3. For tJbitPlane =0, which we call the "half bit plane", a matrix of 

( tilcLcnsth ^ 

— bits associated with HH-subband coefficients as their last "half 

I 2 J 

bit" (See (3.24) or (3.30)). 
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5 5. THE PROGRESSIVE SUBBAND CODING ALGORITHM 
5.1 The encoding algorithm 



The encoding algorithm of the present invention is performed at the server 120. In 
the present imaging system this rather time consuming task is performed locally in near 
10 real-time for a ROI, and not on the full image. The encoding algorithm is described for 
images with a single color component, such as grayscale images, but of course may 
also be applied to images with multiple color components. The straightforward 
generalization for an arbitrary number of components will be explained later. 
The lossless algorithm receive as input the following parameters: 

15 



Variable 


Meaning 


coef 


Matrix of subband coefficients, containing 

_ ( tileLength^f _ . 

3x coefficients 

I 2 ) 


HalfBit 


Matrix of bits containing ^ e ^ en S^\ 

v 2 J 



Table 4 - Lossless Encoding Algorithm Input Parameters 

The coding strategy is similar in some sense to that described in A. Said and W. 
20 Pearlman, "A new, fast and efficient image codec based on set partitioning", IEEE 

Trans. Circuits and Systems for video Tech., Vol. 6, No. 3, pp. 243-250, 1996, but the 
preferred embodiment uses no "Zero Tree" data. For all the data blocks with t bitPlane 
>2, we use the lossy encoding algorithm described in previous art with the parameters: 

• coef := coef (The lossy parameter coef initialized with the lossless parameter 
25 coef) 

• equalBinSize := True 



28 



ATTORNEY DOCKET NO, 181 04.001 1U1 



Remark: The lossy algorithm encodes all the bit-plane information for 
t _bitPlane>2 . 

For t_bitPlane< 1, i.e. the least significant bit plane (of the lossless algorithm) and the 
half bit plane, we use a different algorithm described in 5.1.3. 

5.1.1 Encoding algorithm initialization 

The lossless encoding algorithm initialization is the same as the lossy algorithm 
of § 4.1.1 in the above-cited Ser. No. 09/386,264, which disclosure is incorporated 
herein by reference. In order to initialize the encoding algorithm, the following 
procedure is performed: 

1 . Assign to each coefficient coef (x, y) its bit plane b (x, y) such that: 

\coef(x,y)\^[s c 2\s c 2 b+l ) 

2. Compute the maximum bit plane over all such coefficients: 

maxBitPlane (tile) = max (b (x, y )) 

3. Write the value of maxBitPlane (tile) using one byte as the header of 

the data block: 

(t _x,t _y,t _ resolution, maxBitPlane (t _ resolution)) 

4. Initialize all the coefficients as members of their corresponding Typel6 
group. 

5. Initialize a list of significant coefficients to be empty. 
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6. Initialize a coefficient approximation matrix coef as zero. 

5.1.2 The outer loop 

The outer loop of the encoding algorithm scans the bit planes from 
b = maxBUPlane(tile) to b = 0 . The output of each such bit plane scan is the subband 

data block. Since the last stage of the encoding algorithm is arithmetic encoding of 
given symbols, at the beginning of each scan the arithmetic encoding output module is 
redirected to the storage area allocated for the particular data block. Once the bit plane 
scan is finished and the data block has been encoded, the output stream is closed and 
the bit plane b is decremented. After the outer loop is finished the following stages are 
performed: 

1 . Least significant bit plane is encoded (t_bitPlane =1). 

2. Half bit plane is encoded (t_bitPlane = 0). 

The output of the least significant bit plane scan is the data block (Figure 14): 

(t_x, t_y, t_resolution, t_bitPlane=l). 

The half bit plane data block is: 

(t_x, t_y, t_resolution ? t_bitPlane=0). 

5.1.3 Bit-plane scan 

For t_bitPlane > 2, the framework of the bit plane scan is described in Figure 
24, while the pseudo code is given in the above-cited Ser. No. 09/386,264, which 
disclosure is incorporated herein by reference. The scan, for a given level b (b>2) 9 
encodes all of the coefficients' data corresponding to the absolute value interval 

[2 b ~\2 b ) . 

Remark: The encoder method isLastBitPlane() is associated to the t bitPlane = 2. 
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For the least significant bit plane, a pseudo code is described in Figure 15, while a flow 
chart is described in Figure 22. 

Explanations to the least significant bit encoding algorithm: 

1 . The coefficients scanning procedure, i.e. moreCoef() procedure in Figure 15 or 
"More coefficients?" in Figure 22 includes all the coefficients belong to the HH and 
the HL-subband (Figure 14). The LH-subband is skipped, since the least significant 
bit of each coefficient in it is zero (see remark 2 for (3.22)). 

2. The procedure isCoefReporeted() ("Is coefficient reported?" in the flow chart) 
returns false if the coefficient is one of {-1,0,1} , i.e. in all higher bit plane scans it 

was insignificant, otherwise it returns true. 

3. The procedure isCoeffixactZero() ("Coefficient is zero?" in the flow chart) returns 
true iff the coefficient is zero. 

4. The procedure getCoefSign() returns the coefficient's sign. 

For the half bit plane, a pseudo code is described in Figure 16. 

5.2 The decoding algorithm 

Obviously, this algorithm is a reversed step of the encoding algorithm of section 
5.1, performed in the server 120. The client computer 1 10 during the progressive 
rendering operation performs the decoding algorithm. Similar to the encoding 
algorithm, the decoding algorithm is described for an image with one component (such 
as a grayscale image), but of course could also be used with an image with more than 
one component. The input parameters to the lossless algorithm are given below: 
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Variable 


Meaning 


coef 


Empty matrix of subband coefficients to be filled by the decoding algorithm. 


Hal/Bit 


Matrix of bits containing ^ e ^ en S^ ^its. 

v 2 J 



Table 5 - Lossless decoding algorithm input parameters 

For all the data blocks with tbitPlane >2, a "lossy" decoding algorithm is utilized. The 
input parameters for the lossy algorithm are: 

• coef \—coef 

• equalBinSize := True 

• s c :=2 

5.2.1 Decoding algorithm initialization 

L Assign to each coefficient Coef (x,y) the value zero. 

2. Assign to each bit belongs to the HalfBit matrix the value zero. 
3.1nitialize all the coefficients as members of their corresponding Typel6 group. 
4.1nitialize the list of significant coefficients to be empty. 
5Jfthe "first" data block 

{t _ xj _ y,t _ resolution, maxBitPlane (t __ resolution^ 
is available at the client, read the first byte, which is the value of maxBitPlane{tile) . 

5.2.2 The outer loop 

Upon the completion of the outer loop in 5.1.2the following stages are preformed: 

1 . The decoding algorithm scans the least significant bit plane. The input to this 
stage is encoded data block (t_x, t_y, t_resolution ? LeastSiginificant_bitPlane). 

2. The decoding algorithm scans the half bit plane. The input to this stage is 
encoded data block (t_x ? t_y, tresolution, HalfbitPlane). 
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5.2.3 Bit plane scan 

The preferred embodiment follows the lossy prior art of the above-cited Ser. No. 
09/386,264, which disclosure is incorporated herein by reference, for t_bitPlane>2. 
The scan, for a given level b , decodes all of the coefficients' data corresponding to the 
absolute value interval [s2 b , s2 b+l ) . Pseudo codes of the least significant bit plane scan 

and half bit plane scan are described in Figure 16. 
Explanations to the pseudo code: 

1 . The decoder's method moreCoefO scans all the coefficients in the HH, HL and LH 
subband. But, since the LH-subband is skipped in the encoding algorithm, we don't 
call to decodeSymbolO for its coefficients. Instead of this, we update their least 
significant bit as zero. 

2. Recall that LH-subband coefficients that have not been reported until the least 
significant bit-plane must be zero since they are known to be even. 

6. CLIENT WORKFLOW 

With reference to Figure 4, we describe the workflow at the client unit 110. Any 
new ROI generated by the user's action such as a zoom-in, a scroll, or a luminance 
tuning invokes in step 401 a call from the GUI interface to the client imaging module 
with the new ROI view parameters. The client imaging module then computes in step 
402 which data blocks are required for the rendering of the ROI and checks if these 
data blocks are available in the client cache. If not, their coordinate is added to a 
request list ordered according to some progressive mode. The request list is then 
encoded in step 403 and sent to the server. The server responds to the request by 
sending back to the client a stream of data blocks, in the order in which they were 
requested. In step 404 the client inserts them to their appropriate location in the 
distributed database. At various points in time step 405, a rendering of the ROI, is 
invoked. Naturally, the rendering operation is progressive and is able to use only the 
currently available data in the client's database. 
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6.1 Step 401:Receiving the ROI Parameters 

The imaging module on the client computer 120 receives from the GUI 
interface module view parameters detailed in Table 5. These parameters are used to 
generate a request list for the ROI. The same parameters are used to render the ROL 



Table 5 



Variable 


Meaning 


worldPolygon 


A 2D polygon in the original image coordinate system 


scale 


The view resolution. 0 < scale < 1 implies a low view resolution, 
scale = 1 original resolution and scale > 1 a higher than original view 
resolution 


deviceDepth 


A number in the set {8,16,24} representing the depth of the output 
device (screen, printer) 


viewQuality 


A quality factor in the range [l, 7] where 1 implies very low quality and 
7 implies lossless quality 


luminanceMap 


If active: a curve defining a mapping of medical images with more than 
8 bits (typically 10,12,16 bits) per grayscale values to an 8 bit screen 


progressiveMo 


One of: Progressive By Accuracy, Progressive By Resolution, 
Progressive by Spatial Order 



The basic parameters of a ROI are worldPolygon and scale which determine 
uniquely the ROI view. If the ROI is to be rendered onto a viewing device with limited 
resolution, then a worldPolygon containing a large portion of the image will be coupled 
by a small scale . In the case where the rendering is done by a printer, the ROI could 
be a strip of a proof resolution of the original image that has arrived from the server 
computer 120. This strip is rendered in parallel to the transmission, such that the 
printing process will terminate with the end of transmission. The other view 
parameters determine the way in which the view will be rendered. The parameters 
deviceDepth and viewQuality determine the quality of the rendering operation. In 
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cases the viewing device is of low resolution or the user sets the quality parameter to a 
lower quality, the transfer size can be reduced significantly. 

The parameter luminanceMap is typically used in medical imaging for 
grayscale images that are of higher resolution than the viewing device. Typically, 
screens display grayscale images using 8 bits, while medical images sometimes 
represent each pixel using 16 bits. Thus, it is necessary to map the bigger pixel range 
to the smaller range of [0,255] . 

Lastly, the parameter progressiveMode determines the order in which data 
blocks should be transmitted from the server 120. The "Progressive By Accuracy" 
mode is the best mode for viewing in low bandwidth environments. "Progressive By 
Resolution" mode is easier to implement since it does not require the more 
sophisticated accuracy (bit plane) management and therefore is commonly found in 
other systems. The superiority of the "progressive by accuracy" mode can be 
mathematically proven by showing the superiority of "non-linear approximation" over 
"linear approximation" for the class of real-life images. See, e.g., R. A. DeVore, 
"Nonlinear approximation", Acta Numerica, pp. 51-150, 1998. 

The "Progressive by Spatial Order" mode is designed, for example, for a "print 
on demand" feature where the ROI is actually a low resolution "proof print" of a high 
resolution graphic art work. In this mode the image data is ordered and received in a 
top to bottom order, such that printing can be done in parallel to the transmission. 

However, since lossless compression is mostly required in medical images 
transmission, where typically more than 8 bits images are used, we amplify our 
discussion on the curve {luminanceMap hereinabove) which defines the mapping from 
the original image gray scale range (typically 10,12,16 bits) to an 8-bit screen. Further 
more, in medical images viewing, regardless of the original image depth, mapping is 
required in order to control the brightness and contrast of the image. 

6.1.1 Luminance mapping 

Mapping from original image depth (e.g. 10,12,16 bits ) to screen depth 
(typically 8-bits), is defined by a monotonic function (Figure 17): 
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y . j~Q ^riginal _image depth — ^ £() ~ depth — 1 J (6 1) 

The curve influences not only the mapping, i.e. the drawing to the screen, but also the 
request from the server. To understand that, let us concentrate in the maximal gradient 
of the curve (Figure 17). In a lossy mode, the request is created such that the image 
approximation in the client side is close enough to the original image, i.e., the RMS 
(Root Mean Square Error) is visually negligible. When a curve (mapping function) is 
applied, the RMS can be increased or reduced. The maximal RMS increasing factor 
depends on the maximal gradient of the curve as follows: 

RMs(f{l)j(l)) 

RMS _ increasing_factor = i 7 — ^^-U- < max (/ ') , (6.2) 

RMS[lJ) 

where 

• I is the original image 

• I is the approximated image 

• /is the mapping function 



• RMS(I 1 J 2 ) = - 

Image_size 

• max(/') is the maximal gradient of the curve. 

We consider the worst case of the RMS increasing factor i.e.: 

RMS _ increasing_factor = Maximal _ gradient = max (/ f ) 

If the RMS increasing factor is greater than 1, it means that the "new RMS" may be 
greater than we consider as visually negligible error. Thus, the request list should be 
increase (more bit-planes should be requested from the server) in order to improve the 
approximation accuracy. Conversely, if the RMS increasing factor is smaller than 1, the 
request listing can be reduced. The exact specification of this is given in the following 
section. 



6.2 Step 402: Creating the request list 

In step 402 using the ROI view parameters, the client imaging module at the 
client computer 110 calculates data blocks request list ordered according to the 
progressiveMode . Given the parameters worldPolygon and Scale , it may be 
determined which subband tiles in the "frequency domain" participate in the 
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reconstruction of the ROI in the "time domain". These tiles contain all the coefficients 
that are required for an "Inverse Subband/Wavelet Transform" (IWT) step that 
produces the ROL First, the parameter dyadicResolution{ROI) is computed, which is 
the lowest possible dyadic resolution higher than the resolution of the ROL Any 
subband tiles of a higher resolution than dyadicResolution {ROl) do not participate in 

the rendering operation. Their associated data blocks are therefore not requested, since 
they are visually insignificant for the rendering of the ROL If scale > 1 , then the 
highest resolution subband tiles are required. If scale < 2 l - numberQfResolutions then only the 
lowest resolution tile is required. For any other value of scale we perform the 
mapping described below in Table 6. 



Table 6 



scale 


highestSubbandResolution 


scale ^ 2}~ number< ^ Reso ^ ut * ons 


1 


^1-numberOfResolutions ^ SCale ^ 1 


numberOfResolutions - \ — log 2 {scale) J 


scale > 1 


numberOfResolutions 



Once it has been determined which subband tiles participate in the rendering of 
the ROI, it is necessary to find which of their data blocks are visually significant and in 
what order they should be requested. Using well known rate/distortion rules from the 
field of image coding (such as is described in S. Mallat and F. Falzon, "Understanding 
image transform codes", Proc. SPIE Aerospace Conf., 1997), it is not too difficult to 
determine an optimal order in which the data blocks should be ordered by the client 
imaging module (and thus delivered by the server 120). This optimal order is described 
in steps 301-310 of Figure 3 for the "Progressive By Accuracy" mode. The underlying 
mathematical principal behind this approach is "Non-Linear Approximation". 

First, the subband coefficients with largest absolute values are requested since 
they represent the most visually significant data such as strong edges in the image. 
Notice that high resolution coefficients with large absolute value are requested before 
low resolution coefficients with smaller absolute value. Within each given layer of 
precision (bit plane) the order of request is according to resolution; low resolution 
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coefficients are requested first and the coefficients of MghestSubbandResolution are 
requested last. 

The main difficulty of this step is this: Assume a subband tile is required for the 
rendering of the ROL This means that t _ resolution < dyadicResolution{ROl) and the 

tile is required in the IWT procedure that reconstructs the ROL It must be understood 
which of the data blocks associated with the subband tile represent visually 
insignificant data and thus should not be requested. Sending all of the associated data 
blocks will not affect the quality of the progressive rendering. However, in many cases 
transmitting the "tail" of data blocks associated with high precision is unnecessary 
since it will be visually insignificant. In such a case, the user will see that the 
transmission of the ROI from the server 120 is still in progress, yet the progressive 
rendering of the ROI seems to no longer to change the displayed image. 

Additionally, the influence of the luminance mapping on the accuracy level of 
the requested data block as described below. Supposing for some t _ x 9 1 _ y and 
t resolution , the set 

{(t_x,t_y 9 t__ resolution, t _ bitPlane)\ T <t _ bitPlane < maxPlaneBit (t _ resolution) 

is requested where T is the minimal bit plane required to the current view. Here, where 
the luminance mapping is taken in account, the value of T might be increased or 
decreased. 

The number of bit planes reduced (added) from the request list is 

log f ! 

^ Maximal _ gradient 
I.e., for those t_x 9 t_y and t _ resolution mentioned before, the following set is 
requested: 

{(t_xj_y,t_ resolution, t _ bitPlane)\ T'<t_ bitPlane < maxPlaneBit (t _ resolution)} 
where 



r ! =r+ 

Examples: 



log/ 1 



Maximal _ gradient 
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1. Given 

• Image depth of 12-bit 

• Screen depth of 8-bit 

2 8 

• Linear luminance mapping, I.e., Maximal gradient =— = 2 



The number of bit planes reduced from the request list is: 

( 1 > 

log 2 



Maximal gradient 
2. Given a luminance mapping with Maximal gradient =2 
The number of bit planes reduced from the request list is 

log 2 

^ Maximal gradient 

Thus one bit plane is added to the original set. 



log 2 



2~ 4 



= 4. 



2y 



= -1. 



6,3 Step 403: Encoding the request list 

The client imaging module in the client computer 1 10 encodes the request list 
into a request stream that is sent to the server computer 120 via the communication 
network 130 (Figure 1). The request list encoding algorithm is a simple rectangle-based 
procedure. The heuristics of the algorithm is that the requested data block usually can 
be grouped into data block rectangles. From the request list of data blocks indexed by 
the encoding algorithm computes structures of the type 

{(t _xj _y,t _resolution,t _bitPlane),n x ,n y } , n x ,n y >1 

(1.3) 

Each such structure represents the n x n data blocks 

x y 

{{t _x + i,t _y + j,t _ resolution, t _ bitPlane)} , 0 < i < n x , 0 < j < n y 
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The encoding algorithm attempts to create the shortest possible list of structures, 
collecting the data blocks to the largest possible rectangles can do this. It is important 
to note that the algorithm ensures the order of data blocks in the request list is not 
changed, since the server 120 will respond to the request stream by transmitting data 
blocks in the order in which they were requested. A good example of when this works 
well is when a user zooms in into a ROI at a high resolution that was never viewed 
before. In such a case the request list might be composed of hundreds of requested data 
blocks, but they will be collected to one (x,y) rectangle for each pair 

(t _ resolution, t _ bitPlane) . 

6.4 Step 404: Receiving the data blocks 

The client computer 110 upon receiving from the server computer 120 an 
encoded stream containing data blocks, decodes the stream and inserts the data blocks 
into their appropriate location in the distributed database using their ID as a key. The 
simple decoding algorithm performed here is a reversed step of the encoding infra. 
Since the client 1 10 is aware of the order of the data blocks in the encoded stream, only 
the size of each data block need be reported along with the actual data. In case the 
server 120 informs of an empty data block, the receiving module marks the appropriate 
slot in the database as existing but empty. 

Recall that the subband tile associated with each data block is denoted by the 
first three coordinates of the four coordinates of a data block (t_x,t_y,t_ resolution) . 
From the subband tile's coordinates the dimensions are calculated of the area of visual 
significance; that is, the portion of the ROI that is affected by the subband tile. Assume 
that each subband tile is of length tileLength and that the wavelet basis used has a 

maximal filter size maxFilterSize , then defining hFilterSize := \maxFilterSize/l\ and 
factor \= numberOfResolutions - 1 _ resolution + 1 , we have that the dimensions of the 
affected region of the ROI (in the original image's coordinate system) are 
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[t _ x x tileLength factor - hFilterSize factor , (t _ x + 1) x tileLength factor + hFilterSize factor 
[t_yx tileLength factor - hFilterSize factor 9 (t_y + l)x tileLength factor + hFilterSize factor 

These dimensions are merged into the next rendering operation's region. The 
rendering region is used to efficiently render only the updated portion of the ROL 

6.5 Progressive Rendering 

During the transmission of ROI data from the server to the client, the client 
performs rendering operations of the ROL To ensure that these rendering tasks do not 
interrupt the transfer, the client runs two program threads: communications and 
rendering. The rendering thread runs in the background and draws into a pre-allocated 
"off-screen" buffer. Only then does the client use device and system dependant tools to 
output the visual information from the "off-screen" to the rendering device such as the 
screen or printer. 

The rendering algorithm performs reconstruction of the ROI at the highest 
possible quality based on the available data at the client. That is, data that was 
previously cached or data that "just" arrived from the server. For efficiency, the 
progressive rendering is performed only for the portion of the ROI that is affected by 
newly arrived data. Specifically, data that arrived after the previous rendering task 
began. This "updated region" is obtained using the method of step 404 described in 
§6.4. 

The parameters of the rendering algorithm are composed of two sets: 

1. The ROI parameters described in Table 3. 

2. The parameters transmitted from the server explained in Table 5 , with the 
exception of the jumpSize parameter, which is a "server only" parameter. 

The rendering algorithm computes pixels at the dyadic resolution 
dyadicRresolution(ROI). Recall that this is the lowest possible dyadic resolution that is 
higher than the resolution of the ROL The obtained image is then resized to the correct 
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resolution. Using a tiling of the multiresolution representation of the ROI, the steps of 
the algorithm are performed on a tile by tile basis as described in Figure 1 1 . Since the 
tiles' length are tileLength, which is typically chosen as 64, the rendering algorithm is 
memory efficient. 

6.5.1 The rendering rate 

As ROI data is transmitted to the client 110, the rendering algorithm is 
performed at certain time intervals of a few seconds. At each point in time, only one 
rendering task is performed for any given displayed image. To ensure that progressive 
rendering does not become a bottleneck, two rates are measured: the data block transfer 
rate and the ROI rendering speed. If it predicted that the transfer will be finished 
before a rendering task, a small delay is inserted, such that rendering will be performed 
after all the data arrives. Therefore, in a slow network scenario (as the Internet often 
is), for almost the entire progressive rendering tasks, no delay is inserted. With the 
arrival of every few kilobytes of data, containing the information of a few data blocks, 
a rendering task visualizes the ROI at the best possible quality. In such a case the user 
is aware that the bottleneck of the ROI rendering is the slow network and has the option 
to accept the current rendering as a good enough approximation of the image and not 
wait for all the data to arrive. 

6.5.2 Memory constraint subband data structure 

This data-structure is required to efficiently store subband coefficients, in 
memory, during the rendering algorithm. This is required since the coefficients are 
represented in long integer (lossless coding mode) or floating-point (lossy coding 
mode) precision which typically require more memory than pixel representation (1 
byte). In lossy mode, the coefficients at the client side 1 10 are represented using 
floating-point representation, even if they were computed at the server side 120 using 
an integer implementation. This will minimize round-off errors. 

At the beginning of the rendering algorithm, coefficient and pixel memory strips 
are initialized, dyadic Width (ROI) may be denoted as the width of the projection of the 
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ROI onto the resolution dyadicResolution (ROI) . For each component and resolution 
1 < j < dyadicRe solution (ROI) , four subband strips are allocated for the four types of 
subband coefficients: hl,lh,hh and HalfBit . The coefficient strips are allocated with 



dimensions 



2 j- d ya d icRe S oluHon(ROl)-X ^ Jy^^h (tf 0j) , - X tileLength 



maxFilterSize 

+ 



2 2 



For each component and resolution 1 < j < dyadicResolution a pixel strip is 
allocated with dimensions 

.j-^icResoiunoniRoi) x ^icWidth (ROl) , tileLength + maxFilterSize 



Beginning with the lowest resolution 1, the algorithm proceeds with a recursive 
multiresolution march from the top of the ROI to bottom (y direction. Referring to 
Figures 10 and 1 1, in step 1 101, the multiresolution strips are filled with sub-tiles of 
coefficients 1050 decoded from the database or read from the memory cache. From the 
coefficients we obtain multiresolution pixels 1051 using an inverse subband transform 
step 1 102 (shown in further detail in Figure 10). Each time a tile of pixels at 
resolutions j < dyadicResolution (ROI) is reconstructed, it is written into the pixel strip 
at the resolution j . Each time a tile of pixels at the highest resolution 
dyadicResolution (ROI) is reconstructed, it is fed into the inverse color transform and 
resizing steps 1103, 1104. 



6.53 Step 1101: decoding and memory caching 

The subband coefficients data structure described previously in section 6.5.2 is 
filled on a tile basis. Each such subband tile is obtained by decoding the corresponding 
data blocks stored in the database or reading from the memory cache. The memory 
cache is used to store coefficients in a simple encoded format. The motivation is this: 
the decoding algorithm described previously in section 5.2 is computationally intensive 
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and thus should be avoided whenever possible. To this end the rendering module uses 
a memory cache 111 where subband coefficient are stored in very simple encoded 
format which decodes very fast. For each required subband tile, the following 
extraction procedure is performed, described in Figure 25, beginning at step2501. In 
step2502, if no data blocks are available in the database for the subband tile, its 
coefficients are set to zero (step2503). In step2504, if the tile's memory cache storage 
is updated, namely it stores coefficients in the same precision as in the database, then 
the coefficients can be efficiently read from there (step2505). In step2506, the last 
possibility is that the database holds the subband tile in higher precision. Then, the tile 
is decoded down to the lowest available bit plane using the algorithm previously 
described in section 5.2 and the cached representation is replaced with a representation 
of this higher precision information. 

6.5.4 Step 1102: inverse lossless wavelet transform 

This is an inverse step to step 603 performed in the server (see 7.1.5). Following Figure 

21 we see that four "extended" subband coefficient sub-tiles of length 

tileLengthjl + maxFilterSize at the resolution / are read from the coefficient strips data 

structure and transformed to a tile of pixels at the next higher resolution using 

lossless WaveletdTransformType{i) . If i + 1 < dyadicResolution(ROl) , the tile of pixels 

obtained by this step is inserted into the pixel memory strip at the resolution i + 1 . If 

i + 1 = dyadicResolution(ROl) the tile of pixels is processed by the next step of color 

transform. 

Remark: Recall from 5.1.1 that the "half bits" are initialized as zeros, therefore the 
inverse step is well defined even if their "real" value is not available in the client yet. 

6.5.5 Step 1103: inverse color transform 

This is an inverse step to step 603 performed at the server 120. It is performed 
only for tiles of pixels at the resolution MghestSubbandResolution . At this stage, all of 
the pixels of each such tile are in the outputColorSpace and so need to be transformed 
into a displaying or printing color space. For example, if the original image at the 
server 120 is a color image in the color space RGB, the pixels obtained by the previous 
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Ml 



5 step of inverse subband transform are in the compression color space YUV. To convert 
back from YUV to RGB, we use the inverse step described in Figure 13. If the ROI is 
at an exact dyadic resolution, then the tile of pixels in the rendering color space is 
written into the off-screen buffer. Else it is resized in the next step. 

10 

6.5.6 Step 1104: image resize 

In case the resolution of the ROI is not an exact dyadic resolution, the image 
obtained by the previous step must be re-sized to this resolution. This can be 
accomplished using operating system imaging functionality. In most cases the 

15 operating system's implementation is sub-sampling which produces in many cases an 
aliasing effect which is visually not pleasing. To provide higher visual quality, the 
imaging system of the present invention may use the method of linear interpolation, for 
example described in J. Proakis and D. Manolakis, "Digital signal processing", Prentice 
Hall, 1996. The output of the interpolation is written to the off-screen buffer. From 

20 there it is displayed on the screen using system device dependant methods. 

6.5.7 Step 1105: mapping to 8-bit screen 

When luminanceMap is active mapping to 8-bit screen is performed using the 
mapping function described in 6.1.1. 

25 

7. SERVER WORFLOW 

With reference to Figure 5, the operation of the server computer 120 (Figure 1) 
will now be described. Initially, an uncompressed digital image is stored in, for 
example, storage 122 of the server computer 120. This uncompressed digital image 
30 may be a two-dimensional image, stored with a selected resolution and depth. For 
example, in the medical field, the uncompressed digital image may be contained in a 
DICOM file. 

Once the client computer 110 requests to view or print a certain image, the 
server performs the preprocessing step 501. This step is a computation done on data 
35 read from the original digital image. The results of the computation are stored in the 
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server cache device 121. After this fast computation a "ready to serve" message is sent 
from the server to the client containing basic information on the image. 

In step 502, the server receives an encoded stream of requests for data blocks 
associated with a ROI that needs to be rendered at the client. The server then decodes 
the request stream and extracts the request list. 

In step 503, the server reads from cache or encodes data block associated with 
low resolution portions of the ROI, using the cached result of the preprocessing stage 
501. 

If the ROI is a high-resolution portion of the image, the server, in step 504, 
reads from cache or performs a "local" and efficient version of the preprocessing step 
501. Specifically, a local portion of the uncompressed image, associated with the ROI, 
is read from the storage 122, processed and encoded. In step 505, the data that was 
encoded in steps 503-504 is progressively sent to the client in the order it was 
requested. 

7.1 Step 501: preprocessing 

The preprocessing step is now described with respect to Figure 6. The 
preprocessing algorithm's goal is to provide the fastest response to the user's request to 
interact with the image. Once this fast computational step is performed, the server is 
able to provide efficient "pixel-on-demand" transmission of any client ROI requests 
that will follow. In most cases the first ROI is a view of the full image at the highest 
resolution that "fits" the viewing device. The preprocessing algorithm begins with a 
request for an uncompressed image that has not been processed before or has been 
processed but the result of this previous computation has been deleted from the cache. 
As explained, this unique algorithm replaces the possibly simpler procedure of 
encoding the full image into some progressive format. This latter technique will 
provide a much slower response to the user's initial request then the technique 
described below. At the end of the algorithm a "ready to serve ROI of the image" 
message is sent to the client containing basic information on the image. While some of 
this information, image dimensions, original color space, resolution etc., is available to 
the user of the client computer, most of this information is "internal" and required by 
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the client to formulate ROI request lists (§6.2) and progressively render (§6.5). Next we 
describe in detail the preprocessing algorithm. 



7.1.1 Preprocessing Parameters 
Table 7 



Variable 


Meaning 


losslessMode 


If true, preprocessing will prepare data that can be used for 
lossless transmission. 


subbandTransformTyp 


The framework allows the ability to select a different subband 
transform for each resolution of each image. The technical term 
is: non-stationary transforms. 


numberOfResolutions 


The number of resolutions in the Multiresolution structure 
odio uidxcu ior uie image. 

Typically, numberOfResolutions = log 2 {^j ImageSize^ . 


jumpSize 


A number in the range [0, numberOfResolutions - 1] . The 

preprocessing stage computes only the top lower part of the 
image's multiresolution pyramid of the 
size numberOfResolutions — jumpSize. 


tileLength 


Typically = 64. Tradeoff between time and coding performance. 


nTilesX(j) 


Number of subband tiles in the x direction at the resolution j 


nTilesX(j) 


Number of subband tiles in the y direction at the resolution j 


inputColorSpace 


Color space of uncompressed original image. 


outputColorSpace 


Transmitted color space used as part of the encoding technique. 


numberOfComponents 


Number of components in OutputColorSpace . 


threshold (c,j) 


A matrix of values used in lossy compression. The subband 
coefficients of the component c at the resolution j with absolute 

value below threshold (c,j) are considered as (visually) 

insignificant and set to zero. 



Given an input image, the parameters described in Table 7 are computed or 
chosen. These parameters are also written into a header sent to the client 110 and are 
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used during the progressive rendering step 405 (see section 6.5, described previously). 
The important parameters to select are: 

1 . losslessMode : In this mode, progressive transmission of images takes 
place until lossless quality is obtained. Choosing this mode requires the 
preprocessing algorithm to use certain reversible wavelet transforms, 
and can slow down the algorithm. 

2. subbandTransformType(j) : The (dynamic) selection of wavelet basis 

(as described, for example, in I. Daubechies, "Ten lectures on wavelets", 
Siam, 1992) is crucial to the performance of the imaging system. The 
selection can be non-stationary, meaning a different transform for each 
resolution j . The selection is derived from the following 
considerations: 

a. Coding performance (in a rate/distortion sense): This is 
obviously required from any decent subband/wavelet transform. 

b. Approximation of ideal low pass: It is favorable to choose a 
transform such that low resolutions of the image will be of high 
visual quality (some filters produce poor quality low resolutions 
even before any compression takes place). 

c. Fast transform implementation: Can the associated fast transform 
be implemented using lifting steps (as described, for example, by 
L Daubechies and W. Sweldens, "Factoring wavelet transforms 
into lifting steps", J. Fourier Anal. Appl., Vol. 4, No. 3, pp. 247- 
269, 1998), using only integer shifts and additions, etc. Some 
good examples are the Haar and CDF transforms (1,3), (2,2)*** 
described in I. Daubechies, "Ten lectures on wavelets", Siam, 
1992. 

d. Fast low pass implementation: A very important parameter, 
since together with the parameter jumpSize , it determines 
almost all of the complexity of the algorithm. For example, the 
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CDF (1,3) is in this respect the "optimal" transform with three 
vanishing moments. Since the dual scaling function is the simple 
B-spline of order 1, its low pass is simple averaging. Thus, the 
sequence of CDF transforms, using the B-spline of order 1 as the 
dual scaling function, but with wavelets with increasing number 
of vanishing moments are in some sense optimal in the present 
system. They provide a framework for both real time response 
and good coding efficiency. 

Lossless mode: If losslessMode is true we must choose the 
filters from a subclass of reversible transforms (see, for example, 
"Wavelet transforms that map integers to integers", A. 
Calderbank, I. Daubechies, W. Sweldens, B. L. Yeo, J. Fourier 
Anal. Appl., 1998). 

Low system I/O: If the network 130 in Figure 1 connecting 
between the Image residing on the storage 122 and the imaging 
server 120 is slow, the bottleneck of the preprocessing stage (and 
the whole imaging system for that fact) might be simply the 
reading of the original image. In such a case a transform may be 
chosen with a lazy sub-sampling low pass filter that corresponds 
to efficient selective reading of the input image. Many 
interpolating subband transforms with increasing number of 
vanishing moments can be selected to suit this requirement. 
However, this choice should be avoided whenever possible, since 
it conflicts with (a) and (b). 

Image type: If the type of the image is known in advance, an 
appropriate transform can be chosen to increase coding 
efficiency. For example: Haar wavelet for graphic images, 
smoother wavelet for real-life images, etc. In the graphic arts 
field, there are numerous cases of documents composed of low 
resolution real-life images and high resolution graphic content. 
In such a case, a non-stationary sequence of transforms may be 
chosen: Haar for the high resolutions and a smoother basis 
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starting at the highest resolution of a real-life image part. In case 
of low system I/O (f), a non-stationary choice of interpolating 
transforms of different orders is required. 

3. jumpSize : This parameter controls the tradeoff between fast response 
to the user's initial request to interact with the image and response times 
to subsequent ROI requests. When jumpSize is large, the initial 
response is faster, but each computation of a region of interest with 
higher resolution than the jump might require processing of a large 
portion of the original image. 

4. InputColorSpace: The input color spaces supported in lossless mode are: 

a. Grayscale: For grayscale images 

b. RGB 

5. outputColorSpace : The following are color spaces which perform well 
in coding: 

a. Grayscale: For grayscale images 

b. YUV: for viewing color images 

Referring to Table 7 [LP], losslessMode is set to true. Threshold (cj) is not in 

use, since in lossless mode, there is no thresholding. The rest of the variables have the 
same meaning as in the lossy algorithm. 

7.1.2 Memory constraint multiresolution scan data structure 

Most wavelet coding algorithms have not addressed the problem of memory 
complexity. Usually the authors have assumed there is sufficient memory such that the 
image can be transformed in memory from the time domain to a wavelet frequency 
domain representation. It seems the upcoming JPEG2000 will address this issue, as did 
its predecessor JPEG. The preprocessing algorithm also requires performing subband 
transforms on large images, although not always on the full image, and thus requires 
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careful memory management. This means that the memory usage of the algorithm is 
not of the order of magnitude of the original image, as described in J. M. Shapiro/'An 
embedded hierarchical image coder using zero-trees of wavelet coefficients", IEEE 
Trans. Sig. Proa, Vol. 41, No. 12, pp. 3445-3462, 1993. 

Given an uncompressed image we allocate the following number of memory 

strips 

numberOfComponents x {numberOfResolutions - jumpSize) 

of sizes 

^2-(numberofResoiution S -j) j ma g e jp ie } t ^ 3 x tileLength / 2 + maxFilterSize J 
for 1< j < numberOfResolutions- jumpSize -1 and 

[imageWidth, tileLength + 2 x maxFilterSize] 
for j = numberOfResolutions - jumpSize 

That is, the memory usage is proportional to 2~ JumpShe x imageWidth . Each such 
strip stores low-pass coefficients in the color space outputColorSpace at various 
resolutions. 

Referring to Figure 6, during the preprocessing stage, the resolutions are 
scanned simultaneously from start to end in the y direction. For each color component 
and resolution, the corresponding strip stores low-pass coefficients or pixels at that 
resolution. The core of the preprocessing algorithm are steps 604-607, where tiles of 
pixels of length tileLength + 2x maxFilterSize are read from the memory strips and 
handled one at a time. In step 604 the tile is transformed into a tile of length tileLength 
containing two types of coefficient data: subband coefficients and pixels at a lower 
resolution. The subband coefficients are processed in steps 605-606 and are stored in 
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the cache. The lower resolution pixels are inserted in step 607 to a lower resolution 
memory strip. Whenever such a new sub-tile of lower resolution pixels is inserted into 
a strip, the algorithm's memory management module performs the following check: If 
the part of the sub-tile exceeds the current virtual boundaries of the strip, the 
corresponding first lines of the strip are considered unnecessary anymore. Their 
memory is (virtually) re-allocated for the purpose of storing the sub-tile's new data. 

7.1.3 Step 601: Lossless color-transform 

This step is uses the conversion formula described in Figure 13. This step must 
be performed before step 602, because the lossless color conversion is non-linear. 

7.1.4 Step 602: Lossless Wavelet Low Pass 

The motivation for the low pass step is explained in § 6.1.4 in the above-cited 
Ser. No. 09/386,264, which disclosure is incorporated herein by reference. In a lossless 
mode there are a few emphasis that are represented here. 

In step 602, the low pass filters of the 
transforms subbandTransformType{ j) , 

numberOfResolutions - jumpSize <j< numberOfResolutions , are used to obtain a low 
resolution strip at the resolution numberOfResolutions - jumpSize (as can be seen in 

Figure 26). Typically, it is required to low pass about 2 JumpSize lines of the original 
image 1010 to produce one line of the low resolution image. The low pass calculation 
is initiated by a read of tiles of pixels from the memory strips performed in step 604. 
Whenever there is an attempt to read missing low resolution lines, they are computed 
by low passing the original image and inserted into the memory strip. The insertion 
over-writes lines are no longer required, such that the algorithm is memory constrained. 
In the case where a non-linear color transform took place in the previous step 601, the 
results of that transform are low-passed. 

In the lossless mode of operation, the jumpSize parameter defines the number of 
lossless wavelet low pass steps should be done. A single low pass step is the same for 
Haar and CDF (1,3) and defined by the following two stages (taken from (3.20) and 
(3.22): 
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X-direction: s(n) 



x 



(2n) + x(2n+l) 
2 



Y-direction: s (n) = x(2n) + x(2n + 1) . 



Namely, in a 2D representation, the low pass step is defined by 




x (2m + 1, 2n + 1) + x (2m, 2n + 1) x(2m + l,2«) + jc(2m,2«) 



• (7.1) 



2 2 



For jumpSize = 1 and jumpSize = 2 (other sizes practically are not needed), the server 
performs these steps efficiently (almost like the lossy algorithm) by a single operation 
that simulates exactly jumpSize low pass steps defined in (7.1). As noticed from (7.1), 
the simplicity of the formula makes filters such as Haar and CDF (1,3) "optimal" in the 
respect of low pass efficiency. 

7.1.5 Step 603: Forward lossless Wavelet transform 

In Step 603 we perform one step of an efficient local lossless wavelet transform 
(§3), on a tile of pixels at the resolution \ <j< numberOfResolutions - jumpSize . The 

type of transform is determined by the parameter lossless WaveletTransformType(j) . 

As described in Figure 1, the transform is performed on an "extended" tile of pixels is 
of length tileLength + 2xmaxFilterSize (unless we are at the boundaries), read directly 
from a multi-resolution strip at the resolution j + \ . The output of the step is a lossless 
subband tile composed of wavelet coefficients including Half bit coefficients and low 
resolution coefficients/pixels. The transform step is efficiently implemented in integers 
as described in §3. 

The subband transform of step 603 outputs three types of data: scaling function 
(low pass), wavelet (high pass) and Halfbits. The wavelet coefficients are treated in 
step 604 while the scaling function coefficients are treated in step 605. 

Remark: Tiles of pixels which are located on the boundaries sometimes need to 
be padded by extra rows and/or columns of pixels, such that they will formulate a "full" 
tile of length tileLength . 
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7.1.6 Step 604: Variable Length encoding and storage 

In step 604, the subband coefficients that are calculated in step 603 are variable 
length encoded and stored in the cache 12 L If maxBitPlane(tile) = 0 we do not write 

any data. Else we loop on the coefficient groups {coef (2xi + x,2x j + y)} q ^ . For 
each such group we first write the group's variable length length (i 9 j) using 
log 2 {maxBitPlane{tile^ bits. Then for each coefficient in the group we write 
length(ij) + l bits representing the coefficient's value. The least significant bit 

represents the coefficient's sign: if it is 1 then the variable length encoded coefficient is 
assumed to be negative. The HalfBit subband coefficients are written in one-bit per 
coefficient. 

7.1.7 Step 605: Copying low pass coefficients into the mu I ti resolution strip 
structure 

In step 503, unless loss lessMode is true, the subband coefficients calculated in 
step 604 are quantized. This procedure is performed at this time for the following 
reason: It is required that the coefficients computed in the previous step will be stored 
in the cache 121. To avoid writing huge amounts of data to the cache, some 
compression is required. Thus, the quantization step serves as a preparation step for the 
next variable length encoding step. It is important to point out that the quantization 
step has no effect on compression results. Namely, the quantization step is 
synchronized with the encoding algorithm such that the results of the encoding 
algorithm of quantized and non-quantized coefficients are identical 

A tile of an image component c at the resolution j is quantized using the given 
threshold threshold {cj) : for each coefficients x , the quantized value is 
[x/ threshold (c, j)j . It is advantageous to choose the parameters threshold (c, j) to be 

dyadic such that the quantization can be implemented using integer shifts. The 
quantization procedure performed on a subband tile is as follows: 
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1 . Initialize maxBitPlane {tile) = 0 . 

2. Loop over each group of four coefficients 
coef (2 x i + x, 2 x j + y)} xy =0 { . For each such group initialize a variable 

length parameter length (z, j) = 0. 

3. Quantize each coefficient in the group coef (2x/ + x,2x j + y) using the 
appropriate threshold. 

4. For each coefficient, update length {ij) by the bit plane b of the 
coefficient, where the bit plane is defined by 

\coef (2 x i + x 9 2 x j + y)| e [2* threshold (c, y ) , 2* +1 threshold (c, j)) 

5. After processing the group of four coefficients, use the final value of 
length {i,j) to update maxBitPlane {tile) by 

maxBitPlane {tile) = max{maxBitPlane{tile)jength{ij)) 

6. At the end of the quantization step, store the value maxBitPlane {tile) in 
the cache 121. 

Note that for subband tiles located at the boundaries we can set to zero subband 
coefficients that are not associated with the actual image, but only with a padded 
portion. To do this we take into account the amount of padding and the parameter 
maxFilterSize . The motivation for the "removal" of these coefficients is coding 
efficiency. 
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7.2 Step 502: Decoding the request stream 

This is the inverse step of section 6.3. Once the request stream arrives at the 
server 120, it is decoded back to a data block request list. Each data structure the type 
representing a group of requested data blocks is converted to the sub-list of these data 
blocks. 

7.3 Step 503: Encoding low resolution part of ROI 

Step 503 is described in Figure 7. It is only performed whenever the data 
blocks associated with low-resolution subband tile are not available in the server cache 
121, 

Step 701 is the inverse step of step 604 described in §7.1.6. In the preprocessing 
algorithm subband tiles of lower resolution, that is resolutions lower than 
numberOfResolutions - jumpSize , were stored in the cache using a variable length type 
algorithm. For such a tile we first need to decode the variable length representation. 
The algorithm uses the stored value maxBitPlane {tile) . 

1. If maxBUPlane(tile) = 0 , then all the coefficients are set to zero including the 
HalfBit subband. 

2. If maxBitPlane(tile) = 1 , then all the coefficients are set to zero, and the HalfBit 
subband coefficient are read bit by bit from cache. 

3 . Else, as performed in 2, the HalfBit subband coefficient are read bit by bit from 
cache, and we perform the following simple decoding algorithm: 

For each group of four coefficients [coef (2 x i + x 9 2 x j + 0 1 > we read 
log 2 (maxBUPlane(tile)) bits representing the variable length of the group. 

Assume the variable length is length (ij) . For each of the four coefficients we 
then read length (/, j) + 1 bits. The least significant bit represents the sign. The 
reconstructed coefficient takes the value: 
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-1 readBits & 1 = 1 
1 readBits &1 = 0 

In step 702 we use the encoding algorithm described in §5.1 to encode the 
requested data blocks associated with the extracted subband tile. 

7.4 Step 504: Processing high resolution part of ROI 

Step 504 is described in Figure 8. In case we have used jumpSize > 0 in step 
501 and the resolution of the ROI > numberOfResolutions -jumpSize, we are sometimes 
required to perform a local variation of the preprocessing step described in §7.1. 
Whenever the server receives a request list of data blocks we check the following. If a 
data block has been previously computed (present in the cache 121) or is associated 
with a low resolution subband tile data block then it is either simply read from the 
cache or handled in step 503. Else, the coordinates of the data block are used to find the 
"minimal" portion of the ROI that needs to be processed. Then, a local version of the 
preprocessing algorithm is performed for this local portion. The difference here is that 
step 804 replaces Variable Length coding step 604 of the preprocessing algorithm by 
the encoding algorithm given in §5.1. 

7.5 Step 505: Progressive transmission of ROI 

In the final step, the encoded data tiles are sent from the server 120 to the client 
1 10, in the order they were requested. In many cases, data blocks will be empty. For 
example, for a region of the original image with a constant pixel value, all of the 
corresponding data blocks will be empty, except for the first one that will contain only 
one byte with the value zero representing maxBitPlane(tile) = 0. For a low activity 

region of the image, only the last data blocks representing higher accuracy will contain 
any data. Therefore, to avoid the extra side information, rectangles of empty data 
blocks are collected and reported to the client 1 10 under the restriction that they are 
reported in the order in which they were requested. For blocks containing actual data, 
only the data block's size in bytes need be reported, since the client 110 already knows 
which data blocks to expect and in which order. 



{readBits »l)x< 



57 



ATTORNEY DOCKET NO. 18104.0011U1 



The present invention has been described in only a few embodiments, and with 
respect to only a few applications (e.g., commercial printing and medical imaging). 
Those of ordinary skill in the art will recognize that the teachings of the present 
invention may be used in a variety of other applications where images are to be 
transmitted over a communication media 



58 



